Setting and maintaining standards in multiple choice examinations: AMEE Guide No. 37.
نویسنده
چکیده
The process of setting a standard when pass/fail decisions have to be made inevitably involves judgment about the point on the test score scale where performance is deemed to be adequate for the purpose for which the examination is set. As with any process which involves human judgment, setting this standard is likely to include a certain degree of error, which may result in some false positive and false negative decisions. The customary practice of maintaining a constant point on the test score scale at which pass/fail separations are made cannot be justified, as examinations vary in difficulty. The aim of standard setting procedures is to minimize such errors while accounting for the varying difficulty of examinations. A standard may be norm-referenced, where it is dependent on the performance of the particular group of examinees, or criterion-referenced, where it is based on predetermined criteria, irrespective of examinee performance. Where certification of competence is the primary purpose of an examination, the latter is preferred as the decision to be made is whether an individual is competent to practise rather than competent compared to peers. Several methods of standard setting have been used, some of which are based solely on predetermined criteria, while others compromise between norm- and criterion-referenced standards. This guide examines the more commonly used methods of standard setting, illustrates the procedure used in each with the help of an example, and discusses the advantages and disadvantages associated with the use of each. The common errors made by judges in the standard setting process are pointed out and the manner in which judges should be selected, trained and instructed emphasized. A method used for equating similar tests set at different times with the intention of maintaining standards from one examination to the next is illustrated with an example. Finally, the guide proposes a practical method for arriving at a pre-determined standard by the proportionate selection of test-items of known relative difficulties in relation to minimally competent examinees.
منابع مشابه
Setting and maintaining standards in multiple choice examinations
Some time ago, I wrote a paper about the assessment of competence in which I argued that any assessment situation is inevitably a compromise between what is desirable and what is achievable (Van der Vleuten, 1996). There are no fixed and firm strategies that guarantee the perfect compromise. Things strongly depend on specific assessment contexts and local conditions. The challenge is to take a ...
متن کاملThe Impact of Correction for Guessing Formula on MC and Yes/No Vocabulary Tests' Scores
A standard correction for random guessing (cfg) formula on multiple-choice and Yes/Noexaminations was examined retrospectively in the scores of the intermediate female EFL learners in an English language school. The correctionwas a weighting formula for points awarded for correct answers,incorrect answers, and unanswered questions so that the expectedvalue of the increase in test score due to g...
متن کاملDeveloping questionnaires for educational research: AMEE Guide No. 87
In this AMEE Guide, we consider the design and development of self-administered surveys, commonly called questionnaires. Questionnaires are widely employed in medical education research. Unfortunately, the processes used to develop such questionnaires vary in quality and lack consistent, rigorous standards. Consequently, the quality of the questionnaires used in medical education research is hi...
متن کاملQuality analysis of multiple choice questions(MCQs) examinations of noncontinuous undergraduate medical records
Introduction: There are Different methods for evaluating student stuff. One of the most commons is multiple choice questions (MCQs). If properly designed, it is a good way to measure student knowledge. Due to expansion using MCQs, This study was designed in order to review the quality of multiple choice question exam types of medical records students of Hormozgan University of Medical Sci...
متن کاملDiagnostic reference levels (DRLs) for routine X-ray examinations in Lorestan province, Iran
Background: In diagnostic radiology there are two reasons for measuring or estimating radiation doses to patients. Firstly measurements provide a means for setting and checking standards of good practice as an aid to the optimization of patient protection. Secondly estimates of the absorbed dose to tissue and organs in the patients. Materials and Methods: A total of 2382 patients were studied t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Medical teacher
دوره 32 5 شماره
صفحات -
تاریخ انتشار 2008